Classifying Genomic Sequences by Sequence Feature Analysis
نویسندگان
چکیده
Traditional sequence analysis depends on sequence alignment. In this study, we analyzed various functional regions of the human genome based on sequence features, including word frequency, dinucleotide relative abundance, and base-base correlation. We analyzed the human chromosome 22 and classified the upstream, exon, intron, downstream, and intergenic regions by principal component analysis and discriminant analysis of these features. The results show that we could classify the functional regions of genome based on sequence feature and discriminant analysis.
منابع مشابه
Complete Genomic Sequence of a Strain of Tomato Yellow Leaf Curl Virus from Iran
Background and Aims: Tomato yellow leaf curl virus (TYLCV) is one of the most destructive viruses of tomato that leads to reduced tomato yield up to 100% in tropical and subtropical regions. In this study, the complete sequence of TYLCV isolate from Hormozgan province, Iran and its recombination evsent was determined. Methods: TYLCV infected tomato was collected from Hormozgan province. Total D...
متن کاملMolecular phylogeny of some avian species using Cytochrome b gene sequence analysis
Veritable identification and differentiation of avian species is a vital step in conservative, taxonomic, forensic, legal and other ornithological interventions. Therefore, this study involved the application of molecular approach to identify some avian species i.e. Chicken (Gallus gallus), Muskovy duck (Cairina moschata), Japanese quail (Coturnix japonica), Laughing dove (Streptopelia senegale...
متن کاملListeria Monocytogenes La111 and Klebsiella Pneumoniae KCTC 2242: Shine-Dalgarno Sequences
Listeria monocytogenes can cause serious infection and recently, relapse of listeriosis has been reported in leukemia and colorectal cancer, and the patients with Klebsiella pneumoniae are at increased risk of colorectal cancer. Translation initiation codon recognition is basically mediated by Shine-Dalgarno (SD) and the anti-SD sequences at the small ribosomal RNA (ssu rRNA). In this research,...
متن کاملCloning and Characterization of cbhII Gene fromTrichoderma parceramosum and Its Expressionin Pichia pastoris
The genomic and cDNA clones encoding cellobiohydrolase II (CBHII) have been isolated and sequenced from a native Iranian isolate of Trichoderma parceramosum, a high cellulolytic enzymes producer isolate. This represents the first report of cbhII gene from this organism. Comparison of genomic and cDNA sequences indicates this gene contains three short introns and also an open reading frame codin...
متن کاملSequence analysis of ORF94 in different White Spot Syndrome Virus (WSSV) isolates of Iran
White spot syndrome virus (WSSV) is a pathogen that causes high mortality in shrimp culture in the whole world. Sequence analysis of WSSV has shown similarity of WSSV isolates in different countries with exception of a few variable genomic loci. This study investigated the sequence variation of some Iranian WSSV isolates and previously identified isolates. Samples were collected during target ...
متن کامل